# Synthesis on Atom Computation

Dingchao Gao

Institute of Software Chinese Academy of Sciences

June 6, 2024

## **Outline**

- 1. Compilation for Dynamically Field-Programmable Qubit Arrays with Efficient and Provably Near-Optimal Scheduling
- 2. Computational capabilities and compiler development for neutral atom quantum processors—connecting tool developers and hardware experts

#### **Related Works**

- Compiling Quantum Circuits for Dynamically
   Field-Programmable Neutral Atoms Array Processors: Utilizes Z3
   MST, but lacks scalability and fidelity considerations.
- FPQA-C: A Compilation Framework for Field Programmable
   Qubit Array: Employs a rule-based algorithm, offering good scalability,
   but does not achieve the optimal count of 2Q gates.



#### Overview

- Background: Quantum computing with neutral atoms has advanced rapidly.
- Fidelity:

$$f = (f_1)^{g_1} \cdot \overbrace{(f_2)^{g_2} \cdot (f_{\mathsf{exc}})^{|Q|S - 2g_2}}^{\mathsf{two-qubit gate}} \cdot \overbrace{(f_{\mathsf{trans}})^{N_{\mathsf{trans}}}}^{\mathsf{atom transfer}} \cdot \overbrace{\prod_{q \in Q} (1 - T_q/T_2)}^{\mathsf{decoherence}}.$$

- Significant scalability: experiments with up to 6,100 qubits.
- The compilation process is broken down into three tasks: scheduling, placement, and routing.

| Qubit Num | Gate Num | Scheduling | Placement | Routing | Codegen | Total  |
|-----------|----------|------------|-----------|---------|---------|--------|
| 30        | 45       | 0.0008     | 137.32    | 0.0057  | 0.0184  | 137.35 |
| 60        | 60       | 0.0017     | 141.23    | 0.0124  | 0.0379  | 141.28 |
| 90        | 135      | 0.0023     | 144.43    | 0.0304  | 0.0630  | 144.52 |

Table: Timing Results for Different Qubit and Gate Numbers

# **Scheduling**

- Scheduling is crucial for determining the sequence of operations. Graph edge coloring is used to model the scheduling problem.
- Each edge represents a two-qubit gate.
- Colors represent different stages.
- The goal is to minimize the number of stages while ensuring no two adjacent edges share the same color.



qubit interaction graph

| stage | gate                                                         |  |  |
|-------|--------------------------------------------------------------|--|--|
| 0     | g <sub>3</sub> g <sub>6</sub> g <sub>7</sub> g <sub>11</sub> |  |  |
| 1     | g <sub>0</sub> g <sub>5</sub> g <sub>9</sub> g <sub>12</sub> |  |  |
| 2     | g <sub>2</sub> g <sub>4</sub> g <sub>8</sub> g <sub>14</sub> |  |  |
| 3     | g <sub>1</sub> g <sub>10</sub> g <sub>13</sub>               |  |  |

edge-color schedule

# Scheduling: Graph Edge Coloring

#### **Theorem**

**Vizing's Theorem:** For any simple graph G with maximum degree  $\Delta$ , the chromatic index  $\chi'(G)$  satisfies:

$$\Delta \le \chi'(G) \le \Delta + 1$$

where  $\chi'(G)$  is the minimum number of colors needed to color the edges of G.

- There exists an algorithm with runtime  $O(|V| \cdot |E|)$  that provides an edge coloring  $\phi : E \to \{0, 1, 2, ..., \Delta(G)\}$ .
- The maximum gate count is  $\binom{n}{2}$ . Thus, the time complexity of scheduling is  $O(n^3)$ .

## **Placement**

- Placement refers to assigning qubits to physical locations.
- Optimal placement minimizes the distance between interacting qubits.
- This reduces the need for long-distance routing, which can lower fidelity.

#### **Formula**

$$\sum_{g(q,q')\in G} w_g \cdot \mathsf{dist}(m(q),m(q'))$$

where  $w_g$  is the weight for gate g, m is the placement function from qubits to interaction sites, and dist is the **Euclidean distance**.

# **Placement Strategies**

- Use of simulated annealing algorithms to find near-optimal solutions with a constant runtime.
- Balancing between computational efficiency and placement quality.

$$x \in \left[0, \max\left(\lfloor \sqrt{n} \rfloor + 4, x_{\max}\right)\right], y \in \left[0, \max\left(\lfloor \sqrt{n} \rfloor + 4, y_{\max}\right)\right]$$

#### **Formula**

$$w_{g} = egin{cases} 1, & ext{static placement} \\ \max \left(0.1, 1 - 0.1 s_{g} 
ight), & ext{dynamic placement} \end{cases}$$

# Placement example



## Routing

- Routing involves determining paths for qubits to move during computation. Ensuring minimal delay and avoiding congestion are key goals.
- A conflict graph represents the conflicts during the routing process.
- Nodes represent qubits movements, and edges represent conflicts.
- The problem is The Maximum Independent Set (MIS) problem, where one seeks to find the largest set of vertices in a graph such that no two vertices in the set are adjacent.



# **Greedy Algorithm for Bounded Degree Graphs**

- 1). putting all vertices in a list (sorted by distance).
- 2). adding the first vertex to the IS.
- 3). removing all its neighbors from the list, and continuing 2-3.

#### **Theorem**

In bounded degree graphs, there are effective approximation algorithms with constant ratios. For example, a greedy algorithm that forms a maximal independent set by repeatedly choosing the vertex with the minimum degree and removing its neighbors achieves an approximation ratio of  $(\Delta+2)/3$  for graphs with maximum degree  $\Delta$ . Approximation hardness bounds for these cases were shown by Berman and Karpinski (1999).

# **Routing Complexity**

For the number of qubits n, the maximum number of gates is n/2, so the number of vertices is at most n ( $|V| \le n$ ):

- Checking conflicts for all pairs of vertices requires  $O(|V|^2)$  time.
- Sorting the vertices requires  $O(|V| \log |V|)$  time.
- The greedy algorithm requires  $O(|V|^2)$  time. In the worst case, the greedy algorithm needs to be run O(|V|) times.
- In total, there can be O(n) Rydberg stages, resulting in a routing time of  $O(n^4)$ .

Only construct a graph on the first K vertices in the lists (|V| = K):

• The windowed routing takes  $O(n^2 \log n + n^2 K^2)$ .

# **Results and Comparison**

- The compiler, Enola, shows significant improvements in performance.
- Achieves 3.7X stage reduction compared to existing works.
- Demonstrates 5.9X improvement in fidelity on benchmark sets.
- Highly scalable, capable of compiling circuits with up to 10,000 qubits within 30 minutes.
- Outperforms the current state of the art, OLSQ-DPQA.

#### **Conclusion**

- The compilation process for dynamically field-programmable qubit arrays involves scheduling, placement, and routing.
- The method provide near-optimal solutions for scheduling  $(S_{opt} + 1)$  and efficient strategies for placement and routing.
- Enola compiler achieves significant improvements in stage reduction and fidelity.
- Future work includes further optimization and exploring additional constraints.
- Open source availability: https://github.com/UCLA-VAST/Enola

#### **Outline**

- 1. Compilation for Dynamically Field-Programmable Qubit Arrays with Efficient and Provably Near-Optimal Scheduling
- 2. Computational capabilities and compiler development for neutral atom quantum processors—connecting tool developers and hardware experts

# **Compilation subroutines**

## **Definition (Synthesis)**

Given a quantum computation  $U\in\mathbb{C}^{2^n\times 2^n}$  and the native platform gate set  $\Sigma^{\text{native}}$ , synthesis is the task to find a gate sequence

$$\tilde{U} = g_{N-1} \circ \cdots \circ g_0$$

with all  $g_0,\ldots,g_N\in\Sigma^{\text{native}}$  and  $U=\tilde{U}$  up to some small error.

## **Definition (Mapping)**

Given a quantum circuit  $U=g_{N-1}\circ\cdots\circ g_0$  on circuit qubits  ${\bf Q}$  and a hardware configuration with physical qubits  ${\bf P}$  and coupling map  $G({\bf P},{\bf E})$ . The task of *mapping* is to find a bijective function  $f:{\bf Q}\to{\bf P}$  and an insertion of MOVE and SWAP operations such as

$$U = \cdots \circ \mathsf{MOVE}(q_i) \circ \mathsf{SWAP}(q_i, q_k) \circ g(q_i, q_i) \circ \cdots$$

## **Definition (Scheduling)**

Given a quantum circuit U and its corresponding DAG representation D, the objective of scheduling is to determine the optimal timing for the gates to be executed while preserving the integrity of the DAG up to commutation rules.

# Figures of merit

- Gate count
- Operations count
- Fidelity and runtime

$$P(U) = \exp\left(-\frac{t_{\mathsf{idle}}}{T_{\mathsf{eff}}}\right) \prod_{i=0}^{\tilde{N}} \mathcal{F}_{O_i}, \quad U = O_{N-1} \circ \cdots \circ O_0$$

# **Atom computation capabilities**

